PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Sobic.007G065300.1.p
Common NameSb07g005180, SORBIDRAFT_07g005180
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; Liliopsida; Petrosaviidae; commelinids; Poales; Poaceae; PACMAD clade; Panicoideae; Andropogonodae; Andropogoneae; Sorghinae; Sorghum
Family HD-ZIP
Protein Properties Length: 782aa    MW: 84368.4 Da    PI: 5.9322
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Sobic.007G065300.1.pgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox64.31.7e-20100156157
                           TT--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHHC CS
              Homeobox   1 rrkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakekk 57 
                           +++ +++t++q++e+e++F+++++p+ ++r+eL+++lgL+  qVk+WFqN+R+++k+
  Sobic.007G065300.1.p 100 KKRYHRHTQHQIQEMEAFFKECPHPDDKQRKELSRELGLEPLQVKFWFQNKRTQMKN 156
                           688999************************************************995 PP

2START192.81.6e-602995171206
                           HHHHHHHHHHHHHHHHC-TT-EEEE..EXCCTTEEEEEEESSS......SCEEEEEEEECCSCHHHHHHHHHCCCGG...CT-TT-SEE CS
                 START   1 elaeeaaqelvkkalaeepgWvkss..esengdevlqkfeeskv.....dsgealrasgvvdmvlallveellddke...qWdetlaka 79 
                           ela +a++el+++a +++p+W   +   +++++e+ + f+ + +      + ea+r+ +vv+m+   lve+l+d ++    ++  + +a
  Sobic.007G065300.1.p 299 ELAVAAMDELLRMARLDAPLWGGGVagVQLDEEEYGRMFPGGLGprqygLRPEASRDNAVVIMTRDSLVEILMDANRfaaVFSSIVSRA 387
                           57899*****************999999***********6665599999999************************88888888888** PP

                           EEEEEECTT......EEEEEEEEXXTTXX-SSX.EEEEEEEEEEE.TTS-EEEEEEEEE-TTS--.-TTSEE-EESSEEEEEEEECTCE CS
                 START  80 etlevissg......galqlmvaelqalsplvp.RdfvfvRyirqlgagdwvivdvSvdseqkppesssvvRaellpSgiliepksngh 161
                           +t+ev+s+g      galq+m++e+q++splvp R+++fvRy++  ++g+w++vdvS+ds ++ p    v++++++pSg+li++++ng+
  Sobic.007G065300.1.p 388 STHEVLSTGvagsynGALQVMSMEFQVPSPLVPtRESYFVRYCKNNPDGTWAVVDVSLDSLRPSP----VMKCRRRPSGCLIQEMPNGY 472
                           ***************************************************************98....7******************* PP

                           EEEEEEE-EE--SSXXHHHHHHHHHHHHHHHHHHHHHHTXXXXXX CS
                 START 162 skvtwvehvdlkgrlphwllrslvksglaegaktwvatlqrqcek 206
                           skvtwvehv++++r++h+l+r+lv+sgla+gak+wv tl+rqce+
  Sobic.007G065300.1.p 473 SKVTWVEHVEVDDRSVHNLYRPLVNSGLAFGAKRWVGTLDRQCER 517
                           *******************************************97 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
Gene3DG3DSA:1.10.10.601.2E-2379141IPR009057Homeodomain-like
SuperFamilySSF466897.1E-2084157IPR009057Homeodomain-like
PROSITE profilePS5007117.02297157IPR001356Homeobox domain
SMARTSM003891.1E-1998161IPR001356Homeobox domain
PfamPF000464.2E-18100155IPR001356Homeobox domain
CDDcd000861.46E-19100158No hitNo description
PROSITE patternPS000270132155IPR017970Homeobox, conserved site
Gene3DG3DSA:1.10.10.608.3E-4142184IPR009057Homeodomain-like
CDDcd146860.00822153177No hitNo description
PROSITE profilePS5084838.252290520IPR002913START domain
SuperFamilySSF559615.22E-32291519No hitNo description
CDDcd088751.30E-117294516No hitNo description
SMARTSM002348.4E-55299517IPR002913START domain
PfamPF018525.3E-50300517IPR002913START domain
Gene3DG3DSA:3.30.530.205.2E-6391516IPR023393START-like domain
SuperFamilySSF559612.03E-24539771No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0005634Cellular Componentnucleus
GO:0008289Molecular Functionlipid binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 782 aa     Download sequence    Send to blast
MTPARRMPAM IGRNGMAFGS SSALSLSQAD LLDSHHLQQA FQQQLFDQIP ATAVDSSDNI  60
IHGRSDTLVD EFESKSCSEN PDGTSGDDGQ EDPNQRPNKK KRYHRHTQHQ IQEMEAFFKE  120
CPHPDDKQRK ELSRELGLEP LQVKFWFQNK RTQMKNQHER QENAQLRAEN DKLRAENMRY  180
KEALGTASCP SCGGPAALGE MSFDEHHLRL ENARLRDEID RISGIAAKHV GKPMVSFPVL  240
SSPLAAAAAR SPLDLAGAYG VQPGGLGADH LFGVGAGAGD LLRSVSTGQL DADKPMIVEL  300
AVAAMDELLR MARLDAPLWG GGVAGVQLDE EEYGRMFPGG LGPRQYGLRP EASRDNAVVI  360
MTRDSLVEIL MDANRFAAVF SSIVSRASTH EVLSTGVAGS YNGALQVMSM EFQVPSPLVP  420
TRESYFVRYC KNNPDGTWAV VDVSLDSLRP SPVMKCRRRP SGCLIQEMPN GYSKVTWVEH  480
VEVDDRSVHN LYRPLVNSGL AFGAKRWVGT LDRQCERLAS AMASNIPNGD LGVITSIEGR  540
KSMLKLAERM VASFCGGVTA SAAHQWTTLS GSGAEDVRVM TRKSVDDPGR PPGIVLNAAT  600
SFWLPVPPKR VFDFLRDETS RSEWDILSNG GAVQEMAHIA NGRDHGNCVS LLRVNSANSN  660
QSNMLILQES CTDASGSYVV YAPVDVVAMN VVLNGGDPDY VALLPSGFAI LPDGPPGMAP  720
HGEGAALETG GGSLLTVAFQ ILVDSVPTAK LSLGSVATVN SLIACTVERI KAAVCVEGNP  780
Q*
Expression -- UniGene ? help Back to Top
UniGene ID E-value Expressed in
Sbi.123200.0panicle
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankBT0663760.0BT066376.1 Zea mays full-length cDNA clone ZM_BFc0032A06 mRNA, complete cds.
GenBankKJ7275970.0KJ727597.1 Zea mays clone pUT5457 HB transcription factor (HB97) mRNA, partial cds.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_002443965.10.0hypothetical protein SORBIDRAFT_07g005180
SwissprotQ6ZAR00.0ROC1_ORYSJ; Homeobox-leucine zipper protein ROC1
TrEMBLC5YI050.0C5YI05_SORBI; Putative uncharacterized protein Sb07g005180
STRINGSb07g005180.10.0(Sorghum bicolor)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MonocotsOGMP79938147
Representative plantOGRP14515136
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G21750.20.0HD-ZIP family protein